Finding Topic Words for Hierarchical Summarization (DRAFT)

نویسندگان

  • Dawn Lawrie
  • W. Bruce Croft
  • Arnold Rosenberg
چکیده

! "$#% & ' ( *) ' ,+! ./. *0 ( *) ' ,+ 1"2 ! 3) / 4#% . *) ' ,5768 2) :91 9 :;< =" ?>1 ! ./. ( *) @ ) ./ A #B 94 ' C) D ' 1 ' E./ F"4 1"E ! =) G" > H) ' E) I49 = / ?;J) LKM N#% N 4) 0 . *) HOE *) ) '9 = FO2 9 9 HO4 / 9 0 ) ?) B ' H) .E+M;: < 9 9 I4 . ) ' #P) :QN . 0 ) SR4 )DTU ' .E5WV< X H) .Y Z/ M) HO[ *) ) ? ./ \ " &) = L 1 ]./ F" ^5U_X N './9 3) N ; ) ? KM `) a94 F E./ ?) F" 29 9 ' ! "[#% b ' C)! ) ) 9 ? G 1"4 D ! ! ./9 ) ' " I4 c ? !0 +' ,;< F ,;< "4 ,#d ' "] ! V\e-5 6fQ]e-5 gB 4 , ! H) h ! ; ) )G) & ;i) LKM &9 !#d ./ = ];< G = )!) ? ]) 1

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The TITech Summarization System at TAC-2009

This paper presents the TITech summarization system participating in TAC2009. Specifically, we discuss our results for the Update track. We propose a new method for creating summaries by ordering sentences. After a draft summary is obtained, we conduct agglomerative hierarchical clustering on the sentences of the draft summary based on sentence associativity. Then we use a probabilistic method ...

متن کامل

Topic Model Stability for Hierarchical Summarization

We envisioned responsive generic hierarchical text summarization with summaries organized by topic and paragraph based on hierarchical structure topic models. But we had to be sure that topic models were stable for the sampled corpora. To that end we developed a methodology for aligning multiple hierarchical structure topic models run over the same corpus under similar conditions, calculating a...

متن کامل

Detection of Topic and its Extrinsic Evaluation Through Multi-Document Summarization

This paper presents a method for detecting words related to a topic (we call them topic words) over time in the stream of documents. Topic words are widely distributed in the stream of documents, and sometimes they frequently appear in the documents, and sometimes not. We propose a method to reinforce topic words with low frequencies by collecting documents from the corpus, and applied Latent D...

متن کامل

An Integrated Multi-document Summarization Approach based on Word Hierarchical Representation

This paper introduces a novel hierarchical summarization approach for automatic multidocument summarization. By creating a hierarchical representation of the words in the input document set, the proposed approach is able to incorporate various objectives of multidocument summarization through an integrated framework. The evaluation is conducted on the DUC 2007 data set.

متن کامل

A Hybrid Hierarchical Model for Multi-Document Summarization

Scoring sentences in documents given abstract summaries created by humans is important in extractive multi-document summarization. In this paper, we formulate extractive summarization as a two step learning problem building a generative model for pattern discovery and a regression model for inference. We calculate scores for sentences in document clusters based on their latent characteristics u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001